Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method

نویسندگان

  • Yabin Zheng
  • Lixing Xie
  • Zhiyuan Liu
  • Maosong Sun
  • Yang Zhang
  • Liyun Ru
چکیده

Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper, we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors, we collect 54, 309, 334 error-correction pairs from a realworld data set that contains 2, 277, 786 users via backspace operations. In addition, we present a comparative analysis of the data to achieve a better understanding of users’ input behaviors. Comparisons with English typos suggest that some language-specific properties result in a part of Chinese input errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chinese Pinyin Input Method for Mobile Phone

Chinese input method is one of the most difficult problems in Chinese Language Processing. And to input Chinese word in mobile phone effectively is an even bigger challenge. In this paper, we propose a new Chinese pinyin input method in mobile phone. This method uses a compact statistical bigram based language model. Also, to meet the special requirements of Chinese pinyin input in mobile phone...

متن کامل

CHIME: An Efficient Error-Tolerant Chinese Pinyin Input Method

Chinese Pinyin input methods are very important for Chinese language processing. In many cases, users may make typing errors. For example, a user wants to type in “shenme” ( , meaning “what” in English) but may type in “shenem” instead. Existing Pinyin input methods fail in converting such a Pinyin sequence with errors to the right Chinese words. To solve this problem, we developed an efficient...

متن کامل

手機平台 APP 之四縣客語輸入法的研發 (Research and Implementation of Sixian Hakka Pinyin Input Method for Mobile Cell APP) [In Chinese]

The proposal scheme called Hakka pinyin input method is based on Android (IMF) Input Method Framework. Users can input Hakka texts in any APP of mobile cell. When user inputs a Hakka character or Hakka vocabulary phonetic abbreviation, the input method will refer to the input of user and search for a single character phonetic transcription font stored in the SQLite database. The data will send ...

متن کامل

A Unified Approach to Transliteration-based Text Input with Online Spelling Correction

This paper presents an integrated, end-to-end approach to online spelling correction for text input. Online spelling correction refers to the spelling correction as you type, as opposed to post-editing. The online scenario is particularly important for languages that routinely use transliteration-based text input methods, such as Chinese and Japanese, because the desired target characters canno...

متن کامل

A Joint Graph Model for Pinyin-to-Chinese Conversion with Typo Correction

It is very import for Chinese language processing with the aid of an efficient input method engine (IME), of which pinyinto-Chinese (PTC) conversion is the core part. Meanwhile, though typos are inevitable during user pinyin inputting, existing IMEs paid little attention to such big inconvenience. In this paper, motivated by a key equivalence of two decoding algorithms, we propose a joint graph...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011